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Abstract 

We describe a probability distribution on isomorphism classes of principally quasi- 
polarized p-divisible groups over a finite field k of characteristic p which can reasonably 
be thought of as "uniform distribution," and we compute the distribution of various 
^— i statistics (p-corank, a-number, etc.) of p-divisible groups drawn from this distribution. 

pH It is then natural to ask to what extent the p-divisible groups attached to a randomly 

chosen hyperelliptic curve (resp. curve, resp. abelian variety) over k are uniformly 

S distributed in this sense. This heuristic is analogous to conjectures of Cohen-Lenstra 
i type for char k ^ p, in which case the random p-divisible group is defined by a random 

matrix recording the action of Frobcnius. Extensive numerical investigation reveals some 
cases of agreement with the heuristic and some interesting discrepancies. For example, 
plane curves over F 3 appear substantially less likely to be ordinary than hyperelliptic 
curves over F 3 . 

00 

1 Introduction 

o 

{N) Let q be a power of a prime p, and let C/F q be a genus g hyperelliptic curve with affine 

equation 

y 2 = f(x) 

where / is chosen at random from the set of monic squarefree polynomials of degree 2g + 1 . 
We can think of F q (C) as a "random quadratic extension of F,j(t)," and ask about the 
probability distribution (if there is one) on arithmetic invariants of C. 

For example: if t is an odd prime not equal to p, one can ask about the distribution of the 
^-primary part of the ideal class group Cl(F g (C)) - or, equivalently, the group of F g -rational 
points of the ^-divisible group J(C)[£°°]. The distribution of C1(F 9 (C))^°°] is the subject 
of the Cohen-Lenstra conjectures [CL84], which predict that the probability distribution on 
the isomorphism classes of Cl(Fq(C))[£°°] approaches a limit, the so-called Cohen-Lenstra 
distribution, as g — > oo. More precisely, Cohen and Lenstra proposed this conjecture for the 
class groups of quadratic number fields, but it was quickly understood (see e.g., [FW89]) 
that the underlying philosophy was just as valid for hyperelliptic function fields. 
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Much less effort has been devoted to the case where I = p, perhaps because this question 
about function fields has no obvious number field analogue. Nonetheless, it is quite natural 
to ask whether the p-adic invariants of random hyperelliptic curves over F q (or random 
curves, or random abelian varieties) obey statistical regularities. For example: 

What is the probability that a random hyperelliptic curve has ordinary Jacobian? 

More precisely: let P (q,d) be the proportion of the q d — q d ~ l monic squarefree polyno- 
mials f(x) over F q of degree d such that the curve Cf with equation y 2 = f(x) has ordinary 
Jacobian. Then we ask: does lim^oo P (q, d) exist, and if so, what is its value? Of course 
one can ask similar questions about other invariants of the p-divisible group of Jac(Cy), such 
as a-number, p-rank, Newton polygon, or final type. 

One can interpret the Cohen-Lenstra conjecture as an assertion that the ^-divisible group 
of Jac(C/) behaves like a "random principally polarized ^-divisible group over Fq." (This 
point of view begins with Friedman and Washington [FW89] and has subsequently been 
refined by Achter [Ach06], Malle [MallO], and Garton [Garl'2].) A principally polarized 
^-divisible group of rank 2g over F q is the same thing as an abelian group isomorphic 
to (Qi/Zip) 29 , equipped with a nondegenerate symplectic form uj and an automorphism F 
satisfying u(Fx, Fy) = qu(x, y). In other words, F is chosen from a certain coset of Sp 29 (Z£) 
in GSp2 9 (Z^). Choosing F at random with respect to Haar measure then specifies a notion 
of "random ^-divisible group of rank 2g over F q " which allows us to compute notions like 
"the probability that a principally polarized ^-divisible group of rank 2g over F q has no 
nontrivial F 9 -rational point." Moreover, as g goes to infinity, this probability approaches a 
limit; when q is not congruent to 1 mod £, the limit is 

oo 

Ila-o 

1=1 

which is precisely the Cohen-Lenstra prediction for the probability that Jac(C/)[£°°](F 9 ) is 
trivial. 

In the same way, one might ask whether the p-divisible group of Jac(Cy) is in any sense 
a "random principally quasi-polarized p-divisible group over fc." The first task is to define 
this notion. A p-divisible group over F q is determined by its Dieudonne module, a free 
Zq := VF(Fq)-module with some extra "semilinear algebra" structures. It turns out that 
there is a natural correspondence between principally quasi-polarized Dieudonne modules of 
rank 2g over Z q a certain double coset of Sp 2s (Z g ) in the group of Z p -linear transformations 
of Z q 9 . From this description one obtains a probability measure on principally quasi- 
polarized Dieudonne modules, and thus on principally quasi-polarized p-divisible groups 
over k. In §3 we study the statistics of several natural invariants of random principally 
quasi-polarized p-divisible groups, and show that these approach limiting distributions as g 
goes to infinity. For example, we prove 
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Proposition 1.1. The probability that a random principally quasi-polarized p- divisible group 
of rank 2g over F q has a-number r approaches 

oo r 

i=i i=i 

as g — >• oo. In particular, the probability that a random principally quasi-polarized p- divisible 
group of rank 2g over F q is ordinary approaches 

oo oo 
J](l + g-)-l = JJ(1 _,!-%■) 
i=l j=l 

as g — y oo. 

We call the infinite product in (1) the Malle-Garton constant, since it is the same 
constant that occurs in the work of the two named authors on conjectures of Cohen-Lenstra 
type over fields containing a pth root of unity. 

We also compute some statistics for the p-rank and the group of Fg-rational points of 
a random principally quasi-polarized p-divisible group G; for instance, we find that the 
probability that the p-rank of G is g — 1 (one smaller than maximum) is 

oo 

q-'Wii + q-T 1 
i=i 

and the probability that the group of Fq-rational p-torsion points has dimension r (as an 
Fp-vector space) is exactly the Cohen-Lenstra probability 

r oo 
i=l j=r+l 

The notion of "random p-divisible group" having been specified, it remains to ask whether 
the p-divisible groups of random hyperelliptic Jacobians act like random p-divisible groups. 
In §4, we gather some numerical evidence concerning this question. The results are in some 
sense affirmative, but display several surprising (to us) features. 

For instance, the probability that a random hyperelliptic curve over F3 is ordinary does 
not appear to approach the value given in (1). Rather, it is apparently converging to 2/3. 
However, it does not seem that lim^oo P (q, d) = 1 — 1/q in general. For instance, P (5, d) 
appears to be converging to (1 — 1/5)(1 — 1/125) = 0.7936, which is a truncation of the second 
infinite product in (1). For larger q, the difference between P a (q,d) and n£i(l + <7~ l ) 
is too small to detect reliably from our data. We have no principled basis to make a 
conjecture about the precise value lim^oo P (q, d) but our data is certainly consistent with 
the hypothesis that the limit exists, and that 

logpim^Poforf) - a=i(i +P-T 1 ) 

logp 
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(1) 



(2) 



(3) 



goes to — oo as p grows. 

One might speculate that the discrepancy between experiment and heuristic is a result of 
our restriction to hyperelliptic curves. What if we consider random curves, or even random 
principally polarized abelian varieties, more generally? The moduli spaces M g and A g are 
both of general type for large g, making it hopeless to sample curves or abelian varieties truly 
at random. But one can at least study various rationally parametrized families. We find that 
the proportion of ordinary plane curves over F3 is indistinguishable from n&^i(l + 
In other words, plane curves over F3 are substantially less likely than hyperelliptic curves 
to be ordinary. We have no explanation for this phenomenon. The proportion of ordinary 
plane curves over F2 does not seem to be n£i(l + , the data is consistent, though, 

with the hypothesis that (3) holds for plane curves as well as hyperelliptic curves. 

In §4, we discuss the geometry of various strata in the moduli space of hyperelliptic 
curves, and what relationship the statistical phenomena observed in §4 bear to the geometry 
of these spaces. 
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2 From ^-divisible groups to Dieudonne modules 

Let k be a perfect field of characteristic p. We briefly recall the classification of finite 
flat group schemes of p-power order and of p-divisible groups over k afforded by (covariant) 
Dieudonne theory. Some general references on these topics are [Dem72], [Fon77], [Gro74, II- 
III] and [Tat67]. Readers already familiar with this story may skip immediately to the next 
section. 

Let G be a finite commutative fc-group scheme of p-power order. 1 We denote by G v 
the Cartier dual of G, and note that one has a canonical "double duality" isomorphism 
G vv ~ G. We write F G : G and V G : -»• G for the relative Frobenius and 

Verscheibung morphisms, respectively, and when G is clear from context we will simply 
write F r and V r for the r-fold iterates of relative Frobenius and Verscheibung, defined in 
the obvious way. 

Note that by definition, the composition of F and V in either order is multiplication by 
p. Since k is perfect, there is a canonical decomposition of G into its etale, multiplicative, 
and local-local subgroup schemes 

G = G 6t x k G m x k G 11 , (4) 

which is characterized by: 

• G et is the maximal subgroup scheme of G on which F is an isomorphism. 

The order of G is by definition the dimk(A) where G = Spec(A). 
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• G m is the maximal subgroup scheme of G on which V is an isomorphism. 

• G 11 is the maximal subgroup scheme on which F and V are both nilpotent (in the 
sense that F n = V n = for all n sufficiently large). 

For * G {et, m, 11}, the formation of G* is functorial in G, and there are canonical identifica- 
tions (G u ) v ~ (G v ) u , (G m ) v ~ (G v ) 6t and (G 6t ) v ~ (G v ) m . 

For a group G of p-power order that is killed by p, the nonnegative integers 

/ = /(G) := log p (ord(G et )) and a = a(G) := dim fc Hom gps/fc (a p , G) 

are called the p-rank and a-number of G, respectively, where a p = Spec(k[X]/X p ) is the 
unique simple object in the category of p-power order groups over k which are killed by p 
and are of local- local type. 2 We say that G is of a-type if G ~ a™ for some m > 1. 

Remark 2.1. Let G be a fc-group of p-power order that is killed by p. 

1. For any extension k'/k contained in k, one has a(G) = dim^ Hom g p S/ /^/(a p , Gy) so the 
a-number is insensitive to algebraic extension of k. 

2. A finite A:-group of p-power order that is killed by p is of a-type if and only if its 
relative Frobenius and Verscheibung morphisms are both zero. It follows easily from 
this that G has a unique maximal subgroup of a-type, which we denote by G[F, V]. 
We then have G[F, V] ~ o^ G \ so a(G) is the largest integer m for which there exists 
a closed immersion a™ > G of /c-group schemes. 

3. As the inclusion G et {k) <->■ G(k) is an equality, the p-rank of G is the nonnegative 
integer / for which |G(/c)| = p-> , and one will often see the p-rank of G defined this 
way. 

4. In the special case that G ~ G v , one has G m ~ (G v ) m ~ (G 6t ) v , so also 

/(G) = log p (ord(G m )) = dim Fp Hom gps/S (/, p , G^). 

The right side of the equation above is often used as a definition of /(G) in the 
literature, since then the definitions for p-rank and a-number look more alike. 

Attached to G is its (covariant) Dieudonne module, D(G), which is a finite length 
W (k)-module equipped with a u-semilinear additive map F: D — )■ D and a cr _1 -semilinear 
additive map V : D — )■ Z) which satisfy -FF = = p. We view D(G) as a module over the 
Dieudonne ring: this is the (generally) noncommutative ring generated over W(k) by 
two indeterminates F and V which satisfy the relations -FA = a(X)F, VX = a~ 1 (X)V and 
FV = VF = p for all A G W(k). 

2 In the definition of a-number, we view the Horn group as a left module over End(a p /fc) = k in the obvious 
way. 
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If D is any (left) A^-module of finite W(&)-length, we define the dual of D to be the 
A fc -module L> v := Rom w(k) (D,W(k)[l/p]/W(k)) with 3 F D v := V% and V D v = F%. One 
checks that the expected double duality isomorphism D yy ~ D (as left Afc-modules) holds. 
The main theorem of classical Dieudonne theory is: 

Theorem 2.2. The functor G ~~> D(G) from the category of commutative finite k- group 
schemes of p-power order to the category of left A k -modules of finite W(k) -length is an 
exact equivalence of abelian categories. Moreover: 

1. The order of G is p v where v = length^j.) D(G). 

2. There is a natural isomorphism of left A^-modules D(G V ) ~ D(G) V . 

3. The canonical decomposition G = G m x G ct x G 11 corresponds to the canonical decom- 
position of D := D(G) 

D = D 6t © D m © D n 
where D* := D(G*) for* G {et,m, 11}. These A^-submodules are characterized by: 

• D ct is the maximal submodule of D on which F is bijective. 

• D m is the maximal submodule of D on which V is bijective. 

• D 11 is the maximal submodule of D on which both F and V are nilpotent. 

4- D(oip) = k with F = V = 0. In particular, if G is killed by p then D(G[F,V]) is 
identified with the intersection kerFnkerV C D(G), so a(G) = dimfc(ker F n ker V). 

By definition, a p-divisible (= Barsotti-Tate) group over k of height h is an 

inductive system G := {(G v ,i v )} v >o of finite /c-group schemes, with G v of order p hv , such 
that 

>■ G v _ ^ G v+ i -?—>■ G v+ i -0 (5) 

is an exact sequence of /c-group schemes for all v > 0. The two prototypical examples are 
Qp/Zp := (Z/p v Z,i v ) and /x p °° := (fi p v,i v ), with i v the obvious closed immersions in each 
case. We will be primarily interested in the p-divisible groups associated to abelian varieties: 
If A/k is an abelian variety, then := (^4[p v ], i v ) is naturally a p-divisible group, with i v 

the canonical closed immersion. We recall that the dual of G is by definition the inductive 
system G v := with j v : G„+i — > G v the unique map 4 satisfying i v o j v = p. The 

double-duality isomorphisms at finite level compile to give a canonical isomorphism of p- 
divisible groups 

G VV ^ Q 

over k. Similarly, the decomposition (4) induces a corresponding 
splitting G = G ct G m x^ G 11 with each G* the inductive system of the G*, and we say 
that G is etale if G = G et and so forth. 

3 For any r £ Aut(fc) and any r-semilinear additive map ^ : D — > D, we define the dual of ^ to be 
the T -1 -semilinear additive map * v : D v ->■ D v whose value on any linear functional L £ D v is the linear 
functional * V (L) := r" 1 oLot. 

4 This map exists as the exactness of (5) forces the multiplication by p map on G v +i to factor through i v . 
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Furthermore, there are natural identifications (G u ) v = (G v ) u , (G m ) v ~ (G v ) ct and 
(G et ) v ~ (G v ) m induced by the ones at finite level. 

A principal quasi-polarization of a p-divisible group G is an isomorphism A : G ^> G v 
with the property that the composition G vv — > G v of the canonical double duality map 
G vv ~ G with A 

coincides with — A v . We say that G is principally quasi-polarized, or pqp for short, 
if it is endowed with a principal quasi-polarization. If G = A[p°°] for an abelian variety A, 
then any polarization of degree prime to p on A induces a principal quasi-polarization on G. 

The Dieudonne module of a p-divisible group G := {(G v ,i v )} is by definition the 
A fc -module D(G) := lim. D(G„). 

i Jv 

It follows easily from Theorem 2.2 (1) and definitions that D(G) is free of rank h as a 
Vl^(A:)-module. If D is any (left) ^-module that is W(fc)-finite and free, we define the dual 
of D to be the A fc -module D y := Rom w{k) (D, W{k)) with F D v ■= and V D v := F%, and 
we define the p-rank and a-number of G to be the corresponding p-rank and a-number of 
G[p] := G\. From Theorem 2.2 we obtain: 

Theorem 2.3. The functor G ~~> D(G) from the category of p- divisible groups over k to 
the category of left A^-modules which are finite and free over W(k) is an equivalence of 
categories. If G = {(G v ,i v )} is a p-divisible group of height h with Dieudonne module 
D = D(G) then: 

1. D is a free W{k)-module of rank h, and F, V uniquely determine each other. 

2. There is a natural isomorphism of left A^-modules D(G V ) ~ D v . 

3. The canonical decomposition G = G m x G ct x G 11 corresponds to the canonical decom- 
position of D 

D = D 6t © D m © D 11 
where D* := D(G*) for * £ {et,m, 11}. These Ak-submodules are characterized by 

• D ct is the maximal submodule of D on which F is bijective. 

• D m is the maximal submodule of D on which V is bijective. 

• D 11 is the maximal submodule of D on which F and V are topologically nilpotent. 

4- G is pqp if and only if there exists a symplectic (= perfect, bilinear, alternating) form 
ipG- D x D ^W{k) satisfying tl)a{Fx,y) = a{ipa{x,Vy]). 

5. Set D = D/pD = D(G[p]) and let F := F mod p and V := V mod p. Then the p-rank 
of G is the "infinity rank" of F, i.e. the k-dimension of the maximal subspace of D 
on which F is bijective. The a-number of G is the k-dimension o/ker F n ker V. 
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In view of Theorems 2.2 and 2.3, we will often abuse terminology and say that a 
Dieudonne module has some property if the corresponding p-divisible group or finite group 
scheme has this property, and vice-versa. 

We will be interested in the category BTi of commutative group schemes G over k of 
p-power order that are killed by p which arise as the p-torsion in a Barsotti-Tate group over 
k. This is equivalent to the condition that the sequence 

G G^ -%~ G (6) 

is exact. An object of BTi is called a truncated Barsotti-Tate group of level 1, or a 

BTi for short. We will write DBTi for the category of finite F g -vector spaces equipped with 
semilinear additive maps F and V satisfying FV = VF = as well as im(i ? ) = ker(y) and 
ker(i ? ) = im(V). We note that the Dieudonne module functor restricts to an equivalence 
of categories BTi — > DBTi, and that, for example, \i v and Z/pZ are BTi's whereas a p is 
not. A principal quasi-polarization of a BTi is a homomorphism A: G — > G y with the 
property that the induced bilinear form ip : D(G) xD(G) — > k on the Dieudonne module of G 
is symplectic. A pqp DBTi is simply the Dieudonne module of a principally quasi-polarized 
BTi. 

Remark 2.4. The condition that ip is symplectic implies that A is an anti-selfdual isomor- 
phism, i.e. that A v coincides with —A via the double duality identification G vv ~ G. If 
char(fc) 7^ 2, then these two conditions are in fact equivalent. In characteristic 2, however, 
there exist anti-selfdual isomorphisms A : G — > G y which do not induce a symplectic form on 
the Dieudonne module, basically because there are (skew) symmetric forms in characteristic 

2 which are not alternating. In general, if G is a p-divisible group over k prolonging G, then 
any principal quasi-polarization on G induces a principal quasi-polarization on G. Our def- 
inition of a principal quasi-polarization on a BTi is identical to that found in [MooOl, §2.6] 
and [OorOl, §9.2]. 

3 Statistics of random Dieudonne modules 

In this section we define a probability distribution on the isomorphism classes of pqp 
Dieudonne modules (D, F,V,lo) of rank 2g over Z g . To this end, we fix D = Z 2 q 9 , with 
standard basis {e\, . . . ,e2 g } and corresponding standard symplectic form u, and we will 
think of F and V as the entities to be chosen randomly. We will say that a cr-semilinear 
endomorphism F: D — > D is p-autodual if there exists a <r~ ^semilinear endomorphism 
V ' : D — ^ D such that the quadruple (D, F, V, uj) is a pqp Dieudonne module, or equivalently 
if FV = VF = p and uj(Fx, y) = o~(u)(x, Vy)) for every x,y £ D. Observe that V is uniquely 
determined by F, if it exists. 

For simplicity of notation we say "cr-endomorphism" (resp. "a-automorphism") to mean 
"(7-semilinear endormorphism" (resp. "cr-semilinear automorphism"). Suppose given a p- 
autodual endomorphism F of D. We note first of all that FD C D is a subgroup of index 
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q 9 containing pD; in particular, the quotient FD/pD is a subgroup of D/pD isomorphic to 
(F q ) g . For any x,y £ D, we have 

lo(Fx, Fy) = a(uj(x, VFy)) = pa(uj(x, y)) 

so that FD/pD is an isotropic (whence maximal isotropic) subspace of the symplectic F q - 
vector space D/pD. We denote D/pD by W. 

Let Fo be a p-autodual endomorphism of D. We define F(D) to be the double coset 
Sp(D)Fo Sp(-D), where Sp(D) is the group of (Zg-linear) symplectic similitudes. Then T(D) 
is endowed with a probabilty measure by pushforward from Haar measure on the group 
Sp(D) x Sp(-D). Note that every element F of J-(D) is in fact ap-autodual cr-endomorphism 
of D. We now show that all p-autodual u-endomorphisms arise in this way, which means 
that J-(D) is independent of our original choice of Fq. 

Proposition 3.1. F{D) is the set of p-autodual a-endomorphisms of D. 

Proof. Let F be a p-autodual cr-endomorphism of D. Then FD/pD is a maximal isotropic 
subspace of W. By the symplectic version of Witt's Theorem [Art 57, Theorem 3.9], Sp(W) 
acts transitively on the maximal isotropic subspaces and since Sp(D) surjects onto Sp(PF), 
we can choose g G Sp(-D) such that gF^D/pD = FD/pD. Since FD and gF^D both contain 
pD, these two subgroups of D are actually equal. Now F induces an isomorphism from 
D to FD, so its inverse i 7-1 can be thought of as an isomorphism from FD to D. Thus 
the composition F~ 1 gF$ is a Z q -linear automorphism of D which preserves oj; i.e., it is an 
element g' of Sp(-D). This shows that F lies in Sp(D)Fo Sp(-D), as claimed. □ 

By a random pqp Dieudonne module (D, F, V, uj) of dimension 2g we mean one 
in which F is chosen randomly from F{D) with respect to the above probability measure, 
and V is determined from F. When there is no danger of confusion we denote (D,F,V,uj) 
simply by D. 

If X is a statistic of Dieudonne modules (such as a-number, or dimension of local-local 
part) we denote by £ g (X) the expected value of X(D) where D is a random pqp Dieudonne 
module of rank 2g. The statistics of interest to us are those where £ g (X) approaches a limit 
as g — > oo; in this case we denote the limit by £{X) and refer to it as the expected value 
of X for a random pqp Dieudonne module, the "large g limit" being understood. If P is a 
true-or- false assertion about Dieudonne modules, the "probability that a random Dieudonne 
module satisfies P" is understood to mean the expected value of the Bernoulli variable which 
is 1 when P holds and otherwise. 

We define J~{D) to be the reduction of J~{D) modulo p, with its inherited probability 
distribution. In other words, if Fq is the reduction of Fq to a u-endomorphism of W, then 
F(D) is Sp(W)Fo Sp(W), with the probability distribution obtained by pushforward from 
the Haar measure (i.e., the counting measure) on Sp(VF) x Sp(W). If F is an element of 
J-(D), then F is a cr-endomorphism of W whose kernel is a maximal isotropic subspace 
of W. The reduction u; of to provides an isomorphism A : W — > W y between W and its 



9 



F„-linear dual, and we set V := A" 1 o F v o A. Note V is then a a -endomorphism of W 
satisfying VF = FV = 0, and that in fact (W, F, V, w) is a pqp DBTi. 

By a random pqp DBTi we mean a DBTi (W, F, V), endowed with the sympletic form 
uj, obtained as above by choosing a random element of J-(D). Statistics of random pqp 
DBTi are again understood to be computed in the large g limit. 

In this paper, we will restrict our attention almost entirely to invariants of p-divisible 
groups which depend only on the associated DBTi, such as a- number and p-corank. It 
should certainly be possible to compute the statistics of more refined invariants (e.g., New- 
ton polygon) but with the aim of avoiding ungrounded speculation in the context of abelian 
varieties, we have mostly restricted ourselves to invariants for which we have collected sub- 
stantial experimental data on Jacobians on curves. 

The kernel and image of F are maximal isotropic subspaces of W of dimension g; we 
denote by F' the cr-isomorphism from W/ ker F to FW induced by F. The following propo- 
sition provides a useful description of a random pqp DBTi i n terms of ker F, im F, and 
F' . 

Proposition 3.2. Let (W, F, V,uJ) be a random pqp DBTi. Then (ker F, im F) is uniformly 
distributed on the set of pairs of maximal isotropic subspaces of W , and F' is uniformly 
distributed among a -isomorphisms from (W/Wi) to W%. 

Proof. The action of Sp(II / ) x Sp(H^) is transitive on pairs of maximal isotropic subspaces, 
and the probability distribution on F(D) is invariant under this action; this gives the first 
assertion. Now suppose that we condition on ker F = W\ and imF = W2; let F(D, W\, W2) 
be the subset of F(D) satisfying this condition. Then J r (D,Wi,W2) is still invariant un- 
der left multiplication by the subgroup of Sp(VF) preserving W2; this subgroup is in fact 
isomorphic to GL(W^) an d permutes the choices of F' transitively. This yields the second 
assertion. 

□ 

The definitions given here may seem somewhat unsatisfactory; our "random DBTi" is 
in some sense more like "a random DBTi with a choice of Z^-basis." We show below that 
our definition conforms with a more intrinsic definition of random DBTi. The groupoid 
formalism used here will not return again until the proof of Proposition 3.10. 

Definition 3.3. Let G be a finite groupoid; that is, G is a groupoid with finitely many 
isomorphism classes of objects and finite Horn sets. The uniform distribution on G is the 
unique distribution on isomorphism classes of objects whose mass on an isomorphism class c 
is inversely proportional to the number of automorphisms of an object in c. We say a finite 
set of objects S in G is uniformly distributed in G if the probability that a random element 
of S lies in an isomorphism class c is given by uniform measure. 

The desirability of counting objects with weights inversely proportional to the size of 
their automorphism group has been known at least since Siegel's mass formula; as regards 
general groupoids we learned the formalism from Baez and Dolan. 
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It is clear that an equivalence between groupoids G\ and G% carries uniform measure on 
G\ to uniform measure on (This is just the groupoid version of the fact that a bijection 
of sets transports counting measure from one to the other.) Similarly, if S is a finite set 
with a free action of a group T, the pushforward from S to S/T of uniform measure on S is 
uniform measure on S/T. The following easy proposition records the fact that the same is 
true in the groupoid setting. 

Proposition 3.4. Let S be a finite set, let T be a finite group acting on S, and let S/T be 

the groupoid whose objects are S and whose morphisms from s to s' are group elements 7 in 
T such that 7 • s = s'. Then the objects of S are uniformly distributed in S/T. 

Proof. The probability that a random s in S lies in the T-orbit Tsq of a fixed sq G S is 
precisely 

\Ts \/\S\ = l/Aut s/r (s ). 

□ 

We now explain how this formalism applies in the present context. Let (W,uJ) be a 
F 9 -vector space of dimension 2g endowed with a nondegenerate symplectic form. Let S be 
the set of u-endomorphisms F: W —> W whose kernel is a maximal isotropic subspace of W. 
Note that S is in bijection with the set of triples (Wi, W2,F') described in Proposition 3.2; 
in particular, a random DBTi of rank 2g is the same thing as a random element of S in 
uniform distribution. 

Now the group T = Sp 2g (F 9 ) acts on S by changes of basis preserving the symplectic 
form. And the groupoid S/T is equivalent to the category of (Dieudonne modules of) 
principally quasi-polarized BTi's. Thus, a random pqp DBTi, in our sense, is a random 
principally polarized DBTi m the sense of Definition 3.3. 

The above discussion is rather formal, but we will see that the groupoid viewpoint is 
quite convenient in the proof of Proposition 3.10. 

The a-number of a random pqp DBTi 

Let (D,F,V) be a DBTi over F q . By Theorem 2.2 (4), the a-number of D is the k- 
dimension of the intersection kev FnkerV C D; by definition of the category DBTi we have 
ker V = imF, so also a{D) = dimfc(ker F n miF). 

Proposition 3.5. The probability that a(D) = r is 

00 r 

1=1 1=1 

Proof. The a-number does not depend on F' , so we are computing the probability that 
two random maximal isotropic subspaces of a large symplectic space over F q intersect in a 
subspace of dimension r. 
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Let W be a 2<7-dimensional symplectic space. The number of maximal isotropic subspaces 
in W is 



SP2 9 ( F «?)I _ q/2(<7 2 +fl) 1 S P2g( F p) 



(7) 



g (i/2) fl (ff+l)|GL fl (F g )| H q 2 9 2 +9 \GL g (F gJl 

By the symplectic version of Witt's Theorem, the symplectic group Sp(VF) acts tran- 
sitively on the pairs of maximal isotropic subspaces with r- dimensional intersection; so to 
count the number of such pairs, it suffices to compute the size of the stabilizer of such a pair 
in Sp(W). Suppose for instance that the pair is given by 

V\ = (ei, . . . ,e g ), V2 = (ex, ... ,e r , e g + r +i, ■ ■ ■ , e2 9 ). 

Then the stabilizer of the pair (Vi, V2) is the group of matrices of the form 

A B 

(A T y 1 

where A lies in the parabolic subgroup preserving (ei, . . . , e r ) and B is symmetric, having 
zero entry when i > r and j > g + r. 
The order of this group is 



g2+m |GL r (F g )||GL g _ r (F g ) 
1 ^ q( 



j(9-r) 2 



so the number of pairs of maximal isotropics with r-dimensional intersection is 

j(9-r) 2 |Sp 2 „(F- 



\GL r (F q )\ |GL 9 _ r (F g )| q 29>+9 • 1 ; 



Dividing (8) by the square of (7) yields 

_ m qia-rf / |GL g (F p )| \ 2 

q |GL r (F p )||GL,_ r (F g )| V q9 2 J |Sp 2g (F 9 )| 

which, as g goes to infinity with r fixed, approaches the desired quantity. □ 
Remark 3.6. 

(1) We note that this prediction is in keeping with the fact that the locus of abelian 
varieties with a-number at least r in A g /F q has codimension )• 

(2) The work of Poonen and Rains [PR12] posits that the mod p Selmer group of a random 
quadratic twist of a fixed elliptic curve should be distributed like the intersection of 
two random maximal isotropics in an orthogonal vector space. They show that the 
mod p Selmer group actually does arise as the intersection of two maximal isotropics 
- the question, then, is whether these isotropics are in fact "uniformly distributed" 
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in an appropriate sense. Our situation is similar; the o-number of a pqp p-divisible 
group is indeed the dimension of the intersection of the two maximal isotropics FW 
and VW in the symplectic vector space W, and one is asking whether these maximal 
isotropics are distributed uniformly when W arises from an abelian variety. 

(3) The conjectured distribution of the o-number is the same as the distribution on the 
dimension of the fixed space of a random large symplectic matrix over F q , which was 
computed in an unpublished work by Rudvalis and Shinoda [R£ \] (see [FulOO] for 
a review of their results and an alternative proof). This distribution also appears in 
the conjectures of Malle [MallO] and Garton [Garl2] as the conjectured distribution 
of p-ranks of ideal class groups of number fields containing pth roots of unity. In 
the class group context, the relationship with the fixed space of a random symplectic 
matrix is motivated by the analogy between number fields and function fields, where 
the symplectic matrix describes the action of Frob^ on p-adic cohomology. 

The a-number of D is if and only if D is ordinary. We thus have the following corollary. 
Corollary 3.7. The probability that a random pqp Dieudonne module is ordinary is 

oo 

Uil + q-T 1 . 

i=i 

In problems of Cohen-Lenstra type, it is often the case that moments of variables have 
nicer formulae than probability distributions do. The present situation is no exception. 

Proposition 3.8. Let X m (D) be the number of closed immersions of group schemes over 
F q from a™ to the p-torsion in the p-divisible group associated to D. Then £X m (D) = q^' 2 ' . 

Proof. In the language of the present paper we claim that 

X m (D) = (q a ^ - l)(q a ^ -q)... (q a ^ - q^ 1 ). 

Indeed, if G is the p-divisible group attached to D then any closed immersion a™ G\p] of 
group schemes necessarily factors through the maximal a- type subgroup scheme G[F, V] ~ 
ap of G\p] (see Remark 2.1 (2)). In particular, such closed immersions are in bijection 

with closed immersions a™ oip^ G \ which are in bijection with injections F™ <^-> Fq G ' via 
the exact functor D(-), thanks to Theorem 2.2 (4). 

Because the distribution of X agrees with the distribution on the dimension of the fixed 
space of a random matrix g in Sp(PF) (see Remark 3.6 (3)), it suffices to show that the 
number of injections from an m-dimensional vector space into the fixed space of g has the 
desired expected value. By Burnside's Lemma, this is the same as the number of orbits of 
Sp(W) acting on the set of injections i: FJ? ^ W. By the symplectic version of Witt's 
Theorem, two such injections are in the same orbit if the symplectic forms and 
i^Q agree; so the number of orbits is just the number of alternating bilinear forms on an 

m-dimensional vector space, which is 0(2) as claimed. □ 
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The p-corank of a random pqp DBTi 

Suppose that X(D) is a statistic which is invariant under symplectic change of basis, i.e., 
under conjugation of F by Sp(IF). As above, by Witt's Theorem all pairs of maximal 
isotropic subspaces with intersection dimension r are in the same orbit of the symplectic 
group. Thus, to compute the expected value of X(D) conditional on a(D) = r, it suffices 
to compute the expected value of X(D) for a fixed choice of W\ and W 2 , and F' chosen 
uniformly from the cr-isomorphisms from W/W\ to W 2 . The composition 

<j>: W 2 > W W/Wt — ^ W 2 

is then a u-endomorphism of W 2 of rank g — r, and in fact is chosen uniformly from the set 
of such d-endomorphisms. 

When W 2 = imF, the <7-endomorphism <j) is just the map FW — > FW induced by F. 
In particular, the p-corank of the p-divisible group attached to D is precisely the corank of 

0°°. 

Proposition 3.9. Let < r < s be integers. Then the probability that the a-number of D 
is r and the p-corank of D is s is 

00 s— 1 s—r 

q-r^+r-s JJ (1 + g -i r l JJ (1 _ g - t) JJ (1 _ q -iy K (Q) 

i=l i=r i=l 



Proof. We first show that the probability that a random a-endomorphism of a ^-dimensional 
vector space V has rank g — r approaches 

00 r 

q~ r2 II (l-OB-'t 1 (10) 

i=r+l i=l 

as (7 goes to infinity. Indeed, the map <f> 1— > (V — > im</>, im^>) defines a bijection between 
CT-endomorphisms V — > V of rank g — r and pairs (4>,W) where W G Gr SiS _ r (F (? ) is a 
subspace of V of dimension g — r and ip : V — > W is a <r-semilinear surjection. For a given 
such IF, there are 

9 f[(q9-j) (11) 

i=0 

such <7-semilinear surjections (indeed, the cr-semilinear surjective maps V — > W correspond 
to surjective linear mappings cr*(V) — > IF, and c*(V) is an F g -vector space of dimension g). 
Considering the stabilizer of the natural action of GL 9 on Gr Si9 _ r shows that 

\ Cr (F)\ = I GL g( F q)l (12) 

1 9 ^ q)l |GL r (F 9 )||GL ff _ r (F s )||M T>fl _ f .(P ff )|- 1 ' 
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Dividing the product of (11) and (12) by | M g (F q )\ yields 

IK" V ~ g*) <f* q^-r? |QL g (F g )| 

q q(9-r) 9 |GL P (F ff )| |GL g _ r (F ? )| q a 2 

which, as g goes to infinity with r fixed, approaches the quantity of (10). 

By [Holll], for integers g > s > r > the number of a-endomorphisms of V such that 
rank(M) = s and rank(M°°) = r is 

r ~}(„9 _ J} \ I ("IT „ JF^\ ( U s r r :- l (a9-r-l _ <n\ n r(g-r) 



m=o (q 9 -q i ))\ GL g _ r (F q )\ (jigrv^: : q 

| GL s _ r (F 9 )| • | GL 3 _ s (F g )| • | M s „ r , 9 _ s (F g ) 

It follows that the probability that a random u-endomorphism of V has rank(M°°) = g — s, 
conditional on rank(M) = g — r approaches 

s— 1 r s—r 

i=r 8=1 i=l 

as 5 approaches infinity. Multiplying (13) by the constant in Proposition 3.5 yields the 
desired result. 

□ 

Summing (9) with s fixed and r between 1 and s gives the probability that the p-corank 
of D is s; for example, the probability that the corank is 1 is q~ l rii=i(l + ■> an d the 

probability that the corank is 2 is (q~ 2 + q~ 3 ) n£i(l + 



The group of F 9 -rational points of a random pqp DBTx 

Let G be a BTi with Dieudonne module W. The group G(F q ) of its F^-rational points is 
the subgroup of G(F q ) fixed by Frob 9 . 

Proposition 3.10. The group of F q -rational points of the group scheme associated to a 
random pqp DBTi has cardinality p d with probability 

d oo 

p-^na-p-T 1 n o--p~ s )- (14) 

8=1 j=d+l 

Proof. We again use the description of (f> (from the proof of Proposition 3.9) as a random 
corank-d a-endomorphism of the ^-dimensional vector space FW. Let (X, Y, (f>x, 4>y) be 
a quadruple where X © Y = FW is a direct sum decomposition, <px is a nilpotent a- 
endomorphism of X with corank d, and 0y is a cr- automorphism of Y. Then (j>x © <?V is 
a corank-ci cx-endomorphism of FW. Conversely, any choice of a corank-d cr-endomorphism 
4> yields a quadruple as above by taking Y to be the subspace of FW on which <j> acts 
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invertibly and X the subspace on which (j> acts nilpotently. So a uniformly chosen corank-d 
(T-endomorphism of FW is the same as a uniformly chosen quadruple (X ,Y, (j>x , 4>y) ■ In 
particular, the action of F on F°°W is precisely 0y, which is drawn uniformly at random 
from the set of <r-automorphisms of Y . 

Now Y is itself a Dieudonne module, on which F is bijective. Fix a positive integer 
N . Let be the category of rank-iV p-torsion Dieudonne modules with F q coefficients on 
which F is bijective, Q p the category of rank- TV" etale group schemes over F q killed by p, and 
J- p the category of N x N matrices over F p . The morphisms from x to y are, respectively: 
isomorphisms of Dieudonne modules from x to y; group scheme isomorphisms from x to y; 
changes of basis intertwining x and y. All three of these categories are finite groupoids and 
all three are equivalent. 

The uniformity of (j)y implies by Proposition 3.4 that Y is uniformly distibuted in D p . 
Thus, the etale group scheme Gy /F q is uniformly distributed in Q p and the matrix My giving 
the action of Frob g on Gy{Fq) is uniformly distributed in T p . It is precisely the dimension 
of coker(My — 1) whose distribution we are trying to study. But applying Proposition 3.4 
again, the distribution of coker(Afy — 1) for My uniformly distributed in F p is identical with 
the distribution obtained by letting My be a random element of the set GLat(F p ). But the 
distribution of dimcoker(My — 1) when My is a random invertible matrix is well-known to 
approach the value (14) as N = dimY —> oo. It is easy to see from Proposition 3.9 that 
dimY is larger than any constant multiple of g with probability 1; this finishes the proof. 

□ 

Corollary 3.11. The expected number of surjections from a random pqp BTi to the constant 
group scheme (Z/pZ) rf is 1. 

We note that the distribution produced here is identical with the Cohen-Lenstra heuristic 
for the distribution of p-ranks of imaginary quadratic fields. This is quite natural when one 
considers the p-torsion in the Jacobian of a random hyperelliptic curve C over a finite field. 
When the field has characteristic p, a heuristic of the form "random hyperelliptic curves 
have random pqp DBTi" would suggest that the finite abelian p-group Jac(C)[p](F g ) is 
distributed according to the Cohen-Lenstra law — in other words, that the conjectural 
distribution of Jac(C)[p](F 9 ) is exactly the same whether C is defined over a finite field of 
characteristic p or of characteristic prime to p (as long as the field contains no pth. roots of 
unity.) 

In the case where C is defined over a finite field k whose characteristic is prime to p, the 
results of [EVW12] prove that the dih moment in Corollary 3.11 is indeed 1 as long as \k\ is 
sufficiently large relative to p and not congruent to 1 mod p. It would be interesting to see 
whether there is any way to adapt the methods of [EVW12] to prove that similar statements 
hold in characteristic p. 
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Other statistics and questions: final types and Newton polygons 

We discuss some further problems which fit into our general framework, but which we have 
not investigated. 

A more refined invariant of a DBTi of dimension 2g is the final type, a g-tuple r = 
(xi, . . . , x g ) of non-decreasing integers such that x\ £ {0,1} and Xi+\ < x\ + 1. Such a 
tuple determines the isomorphism class of the corresponding group scheme over an algebraic 
closure of F q and, conversely, any such tuple arises as the final type of a DBTi; see [Pri08, 
2.3]. We note that, unlike a-number and p-rank, the final type of a DBTi depends on F 
and V, not just their restrictions to the maximal isotropic subspaces kei F and ker^. 

Another invariant of a Dieudonne module D is its Newton Polygon - setting q = p m , the 
Newton polygon of D has, for every root a (counted with multiplicity) of the character- 
istic polynomial of F m , a segment of slope ord p (a)/m; when q = p this is just the Newton 
polygon of the characteristic polynomial of F. The Newton polygon determines D up to 
isogeny; see [Man63, II, §4.1]. We note that, unlike the a-number, p-rank, or the final type, 
the Newton polygon of a Dieudonne module D is not determined by D /pD. The question of 
which Newton polygons are compatible with which final types is a subject of active research 
[Oor04]. 

Questions. 

(1) The p-rank of a Dieudonne module is equal to the number of segments of the Newton 
polygon of slope zero; a natural generalization of Theorem 3.9 is thus the following. 
For A G Q n (0, 1), let m.\(D) be the number of segments of slope A in the Newton 
polygon of D. Does m^(-) converge to a distribution as g — > oo? Moreover, can one 
compute, for a fixed non- negative integer d, the probability that m\(D) = d or the 
average value of m\(D)7 

(2) More generally: the Newton polygon D of a random Dieudonne module has a local- 
local part .D 11 as defined in Theorem 2.2; has rank 2c, where c is the p-corank of D, 
and the Newton polygon of D n has all slopes in the open interval (0, 1). Our expecta- 
tion is that the probability distribution on the Newton polygon of D 11 , conditional on 
the p-corank of D being c, should be given by the probability distribution on Newton 
polygons of nilpotent p-autodual matrices on Zp C . We expect that one can compute 
this distribution by force for small c. 

(3) One can generalize either of these questions by picking, for each g, a subset S g of the 
set of possible Newton polygons (resp., final types) and asking for the proportion of 
Dieudonne modules whose appropriate invariant lies in S g . Of course, some conditions 
on S g will be necessary to ensure that the proportion approaches a limit. An example 
where we expect a positive answer would be that in which S g is the set of final types 
with Tg^i = g — 1 — s for a fixed integer s. 
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4 Random curves, random abelian varieties, and random p- 
divisible groups 



So far, the content of this paper has been purely combinatorial; we have computed moments 
and distributions of various statistics on random pqp BTi's and random p-divisible groups. 
In practice, pqp p-divisible groups typically arise from motives. In this section, we address 
the question of whether p-divisible groups arising from random members of a family of 
abelian varieties are random p-divisible groups in the sense of (15) below. 

Let M\,Mi, ... be a family of schemes (or Deligne-Mumford stacks) and let A\ be an 
abelian scheme over Mj. The three cases we will consider are: 

• M g = "Hg, the moduli space of hyperelliptic genus g curves, and A g the Jacobian of 
the universal curve; 

• M g = M g , and A g the Jacobian of the universal curve; 

• Mg = A g , and A g the universal abelian 5-fold. 

We say that the p-divisible groups associated to such a family are "random" with respect 
to a statistic X if 

hm ^ 9 ^) x (y^) = £X (15) 

Which of these families, with respect to which statistics, yield random p-divisible groups? 
In this section we discuss the numerical evidence concerning this question, and some geo- 
metric properties of strata of moduli spaces in characteristic p which seem closely related to 
the statistics in the first part of the paper. 



Relation with geometry of moduli spaces 

In this short section we record some remarks about the relationship between the heuristics 
presented here and the cohomology of moduli spaces of curves and abelian varieties in 
positive characteristic. There are no theorems in this section, only suggestive relationships 
between conjectures. 

The prediction that the mod p Dieudonne module of the Jacobian of a random hyperel- 
liptic curve over F q is a random pqp DBTi implies, in particular, that 

00 

lim H n g°(F q )/H g (F q ) -+ 1 - IT(1 + q-T 1 = V? + V? 3 + V? 4 + • • • (16) 

i=l 

where Hg° denotes the non-ordinary locus, a divisor in % g . (One can make an analogous 
guess with A4 g in place of T-L g ) Thus, the heuristic goes hand in hand with a belief that the 
non-ordinary locus is an irreducible divisor, at least for all sufficiently large g - otherwise, 
the ratio would have leading term n/q instead of 1/q, where n is the number of F^-rational 
components of Tig 10 - 



18 



We emphasize that almost nothing is known about the irreducibility of the non-ordinary 
locus in Tig or M. g (see [AP08, §3.2] and [AP11, §3.7]). The non-ordinary locus in A g , by 
contrast, is known to be irreducible. 

For a family of curves over F q with random p-divisible groups, Proposition 3.9 shows 

that the proportion of curves with a-number r and p-corank s has leading term q~( 2 )+ r_s ) 
which suggests that the locus of cuves with a-number r and p-corank s is an irreducible 
locally closed subvariety of codimension C^ 1 ) + r — s. This is in fact the codimension in 
A g of the locus of abelian varieties with a-number r and p-corank s (see [Pri08, 2.3]); so 
the heuristics arising from random Dieudonne modules can be read as supportive of (or 
supported by) the expectation that various natural loci of curves intersect the strata in 
Ag/F q with the expected dimension. 

The heuristic (16) can also be used to make guesses about the cohomology of various 
strata in %g and M g - For example: suppose that the restriction map from the cohomology 
of M g to the cohomology of the closed subscheme were an isomorphism in some large 
range. Then one might expect the ratio M.g°(F q )/ \M g (F q ) to be very close to 1/q, contrary 
to what the heuristic predicts. Proving any implication of this kind is well out of reach - the 
Betti numbers of M g grow superexponentially in g, so even with control of the cohomology 
groups in some large range there is no hope of using Weil bounds to get a good approximation 
to M g (F q ) [dJKOO]. 

For hyperelliptic curves, the situation is a bit more legible. For simplicity, write X n /F p 
for the configuration space parametrizing monic degree- n squarefree polynomials f(x), and 
let be the closed subscheme parametrizing those polynomials such that the hyperelliptic 
curve 

y 2 = fix) 

is non-ordinary. It is easy to check that \X n (F q )\ = q n - q n ~ l when n > 1; moreover, the 
etale cohomology of X n is concentrated in degrees and 1. 
If (16) holds, we would have 

K°(F q ) ~ (Q n ~ f-^Wq + lA/ 3 + V<? 4 + •••) = ~ q n ~ 2 + Q n ~ 3 ■ ■ ■ 

This suggests that X™° has cohomology beyond the classes pulled back from X n ; for instance, 
there should be a cohomology class in some even degree generating a subspace on which 
Frobenius acts with trace g n_3 . Moreover, this class might be expected to vanish when 
charFg = 3, since the numerical data below suggests that in characteristic 3 the proportion 
of non-ordinary hyperelliptic curves is precisely 1 jq. 

Problem. Construct such a class in the locus of non-ordinary hyperelliptic curves. 

Corollary 3.11 says that, on average, a random DBTi admits a single surjection to 
(Z/pZ) d . Thus, in a family of curves X with random p-divisible group, parametrized by 
a moduli scheme M, the average number of surjections from Jac(X)(F g ) to (7i/pZ) d is 1. 
This suggests that the moduli space M p d is irreducible, where M p ^ is the moduli space 
parametrizing curves X in M together with a level structure Jac(X) — > (Z/pZ) d . And this 
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irreducibility for every d suggests that the monodromy representation of the moduli space 
of ordinary curves in M on the g- dimensional space of etale p-torsion in Jac(X) has image 
containing SL 9 (F p ). In fact, one could refine Corollary 3.11 to apply under supplementary 
conditions on p-corank, a-number, etc. — and the result would be a prediction that, on any 
of these p-adic strata, the monodromy in the etale p-torsion of Jac(A) has full image. In fact, 
such theorems are already known in the case M = A4 g [AP08] and M = T~L g [AP11, §3.7]. 
It seems reasonable to hope that the results of the those two papers could be used to prove 
that random hyperelliptic curves and random curves satisfy a weak version of the heuristic 
suggested by Corollary 3.11, where a limit q — > oo is taken prior to the limit g — > oo. This 
would be exactly analogous to the method used by Achter in [Ach06] to derive a similarly 
weakened Cohen-Lenstra conjecture from a large-monodromy theorem in £-adic cohomology. 

Experiments 

The tables below contain experimental information about the distribution of a-numbers and 
orders mod p of Jacobians of hyperelliptic and plane curves. 

The constants appearing in the tables are defined as follows. As in Theorem 3.5 we 
define the constant MG(q, r) to be 

oo r 

MG(g, r) := g-(T) JJ(1 + g-)- 1 J](l - q ~*)-\ 
i=i i=i 

Similarly, for a finite group G of p-power order we define the Cohen-Lenstra probability 

to be 

1 oo 

and, finally, we define the truncated Malle-Garton constant to be 

6 

TMG(g;6) :=n(l-^ 2i )- 

Table 1 contains distributions of a-numbers of Jacobians of hyperelliptic curves; we 
explain below how the computations were done. As noted in the introduction, the data 
suggests that the probability that the Jacobian of a random hyperelliptic curve has a-number 
does not approach the value given by our heuristics. Rather, for q = 3 the data suggests 
that the true probability is 2/3 = TMG(3; 1), and for q = 5 it suggests 0.7936 = TMG(5; 2). 
To verify this, for q = 5 we took exhaustive data for low g (i.e., computed the a-number 
of the Jacobian of every hyperelliptic curve of genus g). For larger g it is unreasonable to 
do an exhaustive computation; for g = 21 we computed the a-numbers of the Jacobians of 
819200000 random hyperelliptic curves, and indeed the proportion which were ordinary was 
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closer to the truncated constant. For q > 5, we did not generate enough data to distinguish 
between the Malle-Garton constant and the truncated variant. 

It is natural to ask what the "truncated" version of MG(g, r) should be for larger values 
of r. For instance, the proportion of hyperelliptic curves over F3 with a-number 1 appears 
to converging to a value 0.296.... What limiting value (presumably a power series in 1/3) is 
suggested by this experimental result? 

Table 2 contains distributions of a-numbers of Jacobians of plane curves. The sample 
sizes are necessairly smaller than those of Table 1 (see the comments in the next section). 

The data for q > 2 agrees well with our heuristics, and in particular the truncation phe- 
nomenon disappears (or the discrepancy from heuristics is too small for us to measure.) For 
q = 2, the data does not agree with our heuristics, and for this fact we have no conceptual 
explanation. In particular, we do not see an explanation for this discrepancy along the lines 
of Theorem 4.2 below. 

Table 3 (resp., Table 4) contains the proportion of hyperelliptic curves (resp., plane 
curves) C such that p \ | Jacc(F g )| (where p = charF 9 ). For q 7^ 2 (resp. p > 2) the data 
is consistent with the heuristics suggested by Proposition 3.10. For C hyperelliptic, since 
one can efficiently compute the zeta function of Jacc (and can thus detect when p exactly 
divides | Jacc*(F g )|) we also report the probability that Jacc[p 00 ](F (? ) = Z/pZ. 

Remark 4.1. We find in Table 4 a notable divergence between experiment and heuristic for 
smooth plane curves in characteristic 2; it appears that for plane curves X of odd degree 
over F q with q = 2 m , the order of Jac(X)(F g ) is almost always even. This was puzzling to 
us until we realized that the behavior had a natural explanation; so natural that in fact we 
can prove that the behavior persists for plane curves of every odd degree. 

Theorem 4.2. Let d be a positive odd integer, let k be a finite field of characteristic 2, 
and let F £ k[X,Y,Z] be a homogeneous degree d form cutting out a smooth curve X in 
P 2 /fc. Suppose furthermore that at least one monomial X a Y b Z c with two odd exponents has 
nonzero coefficient in F. Then | Jac(X)(/c)| is even. In particular, the proportion of smooth 
degree-d plane curves X/k with | Jac(X)(/c)| even goes to 1 as d goes to 00. 

Proof. Let ^1,^2 be distinct linear forms, and let ui be the exact differential d{l\/l2) on X. 
The divisor of 00 is automatically a square, since locally one is just computing the derivative 
of a Laurent series in k({t)), and all such derivatives lie in the field of squares k((t 2 )). Let 
D be the divisor such that 2D = div(w); then D is evidently a half-canonical divisor, and 
its divisor class is independent of the choice of u (see e.g., [SV87, §3].) 

On the other hand, the canonical class on a degree-c? plane curve is (d — 3) times the 
hyperplane class. Thus, the divisor (1/2) {d— 3) div^) is also a half-canonical. The difference 
between these two half-canonicals is a 2-torsion point on Jac(X)(/c); it remains to show that 
this point is nonzero under the given conditions. 
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Note that D — (1/2) (d— 3) div^) is principal if and only if the principal divisor div(w) — 
(d — 3)div(^2) is the divisor of a function / G (k(X)*) 2 . Moreover, a direct computation 
shows that div(w) — (d — 3) div^) is the divisor of the function 

which is a square only if dF jdl\ is the square of a homogeneous form of degree (1/2) (d — 1). 
It is easy to see that this is equivalent to the failure of the condition in the theorem. 

□ 

We remark that the converse to Theorem 4.2 does not hold; even when the two half- 
canonicals constructed in the proof do agree, there is no reason there might not be another 
Fq-rational 2-torsion point on Jac(X). 



Methods of computation 

Let C be a curve over F q and let (W, F, V, uj) be the pqp DBTi associated to the p-torsion 
subgroup scheme of the Jacobian of C. Then there exists a canonical isomorphism W = 
H\^(C) such that the induced action of F (resp., V) on H^(C) is equal to the action 
of Frobenius (resp., the Cartier operator) [Oda69, Section 5], and u agrees with the cup 
product pairing. The actions of F and V respect the Hodge filtration 

- H°(X, fil ) - H^ iC) - H\C, Oc) 0. 

Moreover, the action of F on H°(X, O) is visibly trivial and, dually, V(Hj R (C)) = H°(X, O); 
in particular, there exists a basis of H^o(C) with respect to which the semilinear maps F 
and V correspond to the matricies 



" 


B ' 


and 


" A 


C ' 





D 









(17) 



where A, B,C,D G M g (F q ). Since kerV = imF, it follows that a(W) = dim(kerF n ker V) 
is equal to the nullity of A] by duality this is the same as the nullity of D. 

The matrix A is called the Cartier-Manin matrix of C. When C is a hyperelliptic 
curve with affine equation y 2 = f(x), there exists a basis of H^(C) with respect to which 

A is given by the explicit formula (c p j_j), where f{x) R ^ = ^ CkX k and p = char F g ; see 
[Yui78] for details. In particular, one can quickly compute the Cartier-Manin matrix, and 
thus a- number, of a hyperelliptic curve. 

Moreover, one can efficiently compute | Jacc*(F g )| mod p (where q = p r ) from the Cartier- 
Manin matrix. Indeed, | Jacc(F (J )| = -P(l), where P is the characteristic polynomial of the 
rth power of Frobenius acting on H^ t (C; Qe) for any I ^ p; P has integral coefficients and 
its reduction mod p is equal to the characteristic polynomial of the rth power of Frobenius 
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acting on H^ R (C). Choosing a basis so that the matrix corresponding to F is as in (17), F r 
will be of the form 

B' 
D' 



D' = D-a(D)---a r -\D) 



and it thus suffices to compute the value of the characteristic polynomial of D' at 1. We can 
therefore quickly compute the order of Jacc(F g ) mod p from the Cartier-Manin matrix of C. 



When C is a plane curve, we do not know of an explicit formula for A or D in terms of 
the coefficients of the defining equations of C. There is however an algorithm, implemented 
as Magma's CartierRepresentation function, which, for a particular curve C computes a 
representative of A. This computation is much slower than in the hyperelliptic case; accord- 
ingly, the sample sizes are smaller for plane curves. 



Magma code which performs these computations can be found at the Arxiv page for this 
paper or at the third author's web page [CEZB]. 



Table 1: Distribution of a- numbers of Jacobians of hyperel- 
liptic curves. 



Q 


genus 


Sample Size 


a 


Proportion 


MG(f,o) 


TMG(p; £=i) 


3 


25 


40960000 





0.666716 


0.639005 


0.666666 








1 


0.296272 


0.319502 










2 


0.0328910 


0.0399378 






100 


5120000 





0.666497 


0.639005 


0.666666 








1 


0.296487 


0.319502 










2 


0.0329145 


0.0399378 




5 


5 


exhaustive 





0.793278 


0.793335 


0.793600 




6 


exhaustive 





0.793875 








7 


exhaustive 





0.793557 








21 


819200000 





0.793838 


0.793335 


0.793600 




25 


40960000 





0.793529 


0.793335 


0.793600 








1 


0.198172 


0.198334 










2 


0.00822029 


0.00826392 






100 


5120000 





0.793838 


0.793335 


0.793600 








1 


0.197818 


0.198334 










2 


0.00826679 


0.00826392 




7 


25 


40960000 





0.854542 


0.854593 


0.854594 








1 


0.142490 


0.142432 










2 


0.00295969 


0.00296733 




Continued on next page 
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Table 1 continued from previous page 



(7 


genus 


SritidIp lSizp 


a 


Pronortion 


MG(a a) 


TMGO ! —) 


Q 


9^ 
zu 


1 97^000 


n 
u 


fl 8SRQ7D 
u.oooy / u 


U.OO I UOO 


u.oooooy 








i 
i 


u. luyooo 


u. i ruyo * 










2 


00134582 


00138696 






1 nn 

1UU 


i ^nn 


n 
u 


888774 

U.OOO 1 / ^± 


887fi^ 

U.OO / UtJ(J 


u.oooooy 








1 




1 1 00^7 










2 


00135484 


00138696 






9^ 
ZD 


1Z041UUUU 


U 


n Q^QQR9 

u.yoyyoz 


n Q^QQ^Q 

u.yoyyoy 


n Q^QQ^Q 

u.yoyyoy 








i 
i 


fl 0^00749 

u.uoyy < 


u.uoyyy / o 










2 


6 43987E-5 


6 40985E-5 






100 


1036000 





959822 


959939 


959939 








1 


0.0401110 


0.0399975 










2 


6.75676E-5 


6.40985E-5 




27 


25 


13030000 





0.962955 


0.962914 


0.962963 








1 


0.0369945 


0.0370352 










2 


5.07291E-5 


5.08724E-5 






100 


1044000 





0.962741 


0.962914 


0.962963 








1 


0.0372021 


0.0370352 










2 


5.74713E-5 


5.08724E-5 




49 


25 


20480000 





0.979568 


0.979583 


0.979583 


81 


25 


20480000 





0.987662 


0.987653 


0.987655 


125 


25 


20480000 





0.991984 


0.991999 


0.991999 



Table 2: Distribution of a- numbers of Jacobians of plane 
curves. 



Q 


degree 


genus 


a 


Sample Size 


Proportion 


MG(g,o) 


2 


7 


15 





56949615 


0.426022 


0.419422 








1 




0.422294 


0.419422 








2 




0.109071 


0.139807 




10 


36 





80000 


0.423363 


0.419423 


3 


7 


15 





3062000 


0.638947 


0.639006 








1 




0.319267 


0.319502 








2 




0.0404950 


0.0399378 




8 


21 





249230 


0.638133 


0.639006 




10 


36 





154000 


0.639273 


0.639006 








1 




0.318792 


0.319502 








2 




0.0404481 


0.0399378 


Continued on next page 
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Table 2 continued from previous page 



n 








^ TT1 Tl 1 P> ^T7P 
kJdilllJJlC/ kJlZjC 


rTHnnvl'inTi 

J. lUjJVJl UJAJ11 


1\L\J\ y , Uj ) 


A 


7 


1 ^ 


o 

u 


9729000 
Z / ozuuu 


f) 7^7&DQ 
u./oi ouy 


7^7^1 ^ 








1 




245737 


245837 

V, 7 • I '.. 7 V, 7 *. 7 I 










Li 






n ni fi^^Q9 

u.uiuooy^ 




1 


1 ^ 


n 
u 


^91 DOD 

tJZi 1UUU 












1 




242875 










2 




01 741 fi7 




z. 
o 


7 


1 ^ 
J-tJ 


n 




7Q378zL 


70**^ 3^ 

U. ( c/OOOO 




10 


36 


o 


17851 


796874 


0.793335 


Q 


7 


1 ^ 

10 


n 
u 


1 nsnnnn 

lUOUUUU 


U.oo I yzo 


U.oo / OQ4L 








i 
i 




U. 1 lUOoO 


u. i iuyo / 








2 




00143426 


00138696 




1 n 


ou 


n 
u 


1 n^nnn 


U.OOOU1U 










± 




1 1 O^zLQ 










2 




00140196 

V 7 . V / V 1 L I V 7 _l '7 V 7 






7 




n 

u 


tJUOUUU 


u.yuuioj 


Q^QQ^R 








1 

1 






n 0^0007^ 
u.uoyyy / o 








2 




5.33808E-5 


6.40985E-5 




10 


36 





36000 


0.958667 










1 




0.0412778 










2 




5.55555E-5 




27 


7 


15 





947000 


0.962757 


0.962914 








1 




0.0371953 


0.0370352 








2 




4.75185E-5 


5.08724E-5 




10 


36 





89000 


0.962023 










1 




0.0378652 










2 




0.000112360 





Table 3: Distribution of | Jac[p°°](Fg)| for Jacobians of hy- 
pcrclliptic curves. 



q 


genus 


|Jac(F 9 )[p°°]| 


Sample Size 


Proportion 


C P (G) 


3 


12 


P 


20000 


0.280100 


0.280063 




20 


1 


1600000 


0.560527 


0.560126 




25 


1 


400000 


0.560198 


0.560126 


5 


8 


P 


5000 


0.193200 


0.190083 




15 


1 


1600000 


0.759874 


0.760333 


Continued on next page 
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Table 3 continued from previous page 



q 


genus 


| Jac(F,)| 


Sample Size 


Proportion 


C P (r) 




25 


1 


400000 


0.760077 


0.760333 


7 


15 


1 


1600000 


0.837408 


0.836796 


25 


1 


400000 


0.836867 


0.836796 


9 


12 


1 


274016000 


0.560169 


0.560126 


11 


15 


1 


1600000 


0.900908 


0.900833 


25 


1 


400000 


0.900988 


0.900833 


25 


12 


1 


269210000 


0.760389 


0.760333 



Table 4: Probability that p \ | Jac(F g ) | for Jacobians of plane 
curves. 



q 


degree 


genus 


Sample Size 


Proportion 


c P (D 


2 


7 


15 


37940000 


0.071762 


0.288788 




8 


21 


1777100 


0.229230 






9 


28 


313000 


0.000223 






10 


36 


349500 


0.246071 






11 


55 


312000 


0.0000064 




3 


7 


15 


137000 


0.553302 


0.560126 




10 


36 


142000 


0.559676 




4 


9 


28 


47775 


0.000000 


0.288788 




10 


36 


31200 


0.285673 




5 


7 


15 


350000 


0.760462 


0.760332 




10 


36 


48000 


0.761500 




8 


7 


15 


219725 


0.000000 


0.288788 




8 


21 


89575 


0.283349 




9 


10 


36 


57000 


0.558772 


0.560126 
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